Cautious Classifiers

نویسندگان

  • César Ferri
  • José Hernández-Orallo
چکیده

The evaluation and use of classifiers is based on the idea that a classifier is defined as a complete function from instances to classes. Even when probabilistic classifiers are used, these are ultimately converted into categorical classifiers that must choose one class (with more or less confidence) from a set of classes. Evaluation metrics such as accuracy/error, global cost, precision, recall, f-score, specificity, sensitivity, effectiveness, macro-average, logloss, MSE or the Area Under the ROC Curve (AUC) are usually defined for “complete” classifiers. In this paper we pursue the usefulness and evaluation of “cautious” or “partial” classifiers. A cautious classifier adds an extra class “unknown” to the set of the original classes. This “unknown” class represents the cases where the prediction is uncertain or not reliable. Now, in a cost-insensitive context, accuracy and error will not be directly related but indirectly, through the coverage index. We develop new measures, efficacy and capacity, which find a compromise between reducing the number of misclassified data (error) and reducing the number of unclassified data (abstention). Inspired by ROC analysis we introduce several techniques to choose from a set of cautious classifiers. For probabilistic classifiers we define a discretisation method for converting them into cautious classifiers by using a “caution window”. We develop new response graphs to show the way in which different classifiers behave according to the size of the window and the class bias. In a cost-sensitive context, cost matrices and confusion matrices can be directly extended to account for this new class. Moreover, we extend ROC analysis and AUC evaluations to these classifiers, by considering the degree of abstention as an additional dimension.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cautious Collective Classification

Many collective classification (CC) algorithms have been shown to increase accuracy when instances are interrelated. However, CC algorithms must be carefully applied because their use of estimated labels can in some cases decrease accuracy. In this article, we show that managing this label uncertainty through cautious algorithmic behavior is essential to achieving maximal, robust performance. F...

متن کامل

Classifier fusion in the Dempster-Shafer framework using optimized t-norm based combination rules

When combining classifiers in the Dempster-Shafer framework, Dempster’s rule is generally used. However, this rule assumes the classifiers to be independent. This paper investigates the use of other operators for combining non independent classifiers, including the cautious rule and, more generally, t-norm based rules with behavior ranging between Dempster’s rule and the cautious rule. Two stra...

متن کامل

Cautious Inference in Collective Classification

Collective classification can significantly improve accuracy by exploiting relationships among instances. Although several collective inference procedures have been reported, they have not been thoroughly evaluated for their commonalities and differences. We introduce novel generalizations of three existing algorithms that allow such algorithmic and empirical comparisons. Our generalizations pe...

متن کامل

Detection of malicious attacks by Meta classification algorithms

-------------------------------------------------------------------ABSTRACT--------------------------------------------------------------We address the problem of malicious node detection in a network based on the characteristics in the behavior of the network. This issue brings out a challenging set of research papers in the recent contributing a critical component to secure the network. This ...

متن کامل

Multi-level Classification of Emphysema in HRCT Lung Images Using Delegated Classifiers

Emphysema is a common chronic respiratory disorder characterized by the destruction of lung tissue. It is a progressive disease where the early stages are characterized by diffuse appearance of small air spaces and later stages exhibit large air spaces called bullae. A bullous region is a sharply demarcated region of emphysema. In this paper, we show that an automated texture-based system based...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004